Back

JMIR Medical Informatics

16 training papers 2019-06-25 – 2026-03-07

Top medRxiv preprints most likely to be published in this journal, ranked by match strength.

1
Development and validation of an algorithm to identify front-line clinicians using EHR audit log data
2026-02-16 health informatics 10.64898/2026.02.13.26346268
Top 0.2% (4.0%)
Show abstract

BackgroundInterprofessional teams are central to high quality patient care. However, identifying the clinician primarily responsible for a patient requires labor-intensive methodologies. Although electronic health record (EHR) audit logs offer a scalable alternative, its use for identifying frontline clinicians is underdeveloped. ObjectiveTo develop and validate an algorithm utilizing EHR audit logs to identify the primary frontline clinician per patient day of an encounter and to describe care...

2
Identifying Reasons for ACEI/ARB Non-Use in CKD Using Scalable Clinical NLP with Schema-Guided LLM Augmentation
2026-02-12 health informatics 10.64898/2026.02.10.26346025
Top 0.2% (3.9%)
Show abstract

IMPORTANCEAlthough angiotensin-converting enzyme inhibitors (ACEIs) and angiotensin receptor blockers (ARBs) are recommended for people with chronic kidney disease (CKD), they remain underused. Barriers to adherence, such as adverse effects or patient refusal, are frequently embedded within unstructured clinical narratives and are therefore inaccessible to structured data analytics. Scalable natural language processing (NLP) approaches are needed to identify these barriers and support guideline-...

3
Can Machine Learning Algorithms use Contextual Factors to Detect Unwarranted Clinical Variation from Electronic Health Record Encounter Data during the Treatment of Children Diagnosed with Acute Viral Pharyngitis
2026-03-02 health informatics 10.64898/2026.02.23.26346757
Top 0.3% (3.8%)
Show abstract

Rationale, Aims and ObjectivesUnwarranted clinical variation (UCV) in patient care often arises from contextual factors and contributes to increased costs, unnecessary treatments, and deviations from evidence-based practice. Detecting UCV is challenging due to the complexity of care decisions. Current approaches rely on centralized data aggregation and mixed-effects regression, which estimate relative variation but cannot detect absolute variation. Moreover, machine learning (ML) methods leverag...

4
Variability in Automated Sepsis Case Detection: A Systematic Analysis of Implementation Methods in Clinical Data Repositories
2026-03-04 health informatics 10.64898/2026.02.27.26347259
Top 0.3% (3.8%)
Show abstract

ObjectiveTo systematically identify and characterize methodological heterogeneity in sepsis case detection methods using the MIMIC-III database or the eICU-CRD, and to quantify the resulting variability in sepsis detection rates. Materials and MethodsWe conducted a PRISMA-guided systematic review of PubMed and Web of Science (2016-2024), and stratified studies by cohort definition to obtain comparable subsets. We extracted information on sepsis case detection methodology across six domains: par...

5
Patient-Centric Markov-Chain Framework for Predicting Medication Adherence Using De-Identified Data
2026-02-10 health informatics 10.64898/2026.02.08.26345856
Top 0.6% (3.7%)
Show abstract

Long-term adherence to prescribed therapies remains a persistent challenge in chronic and ultra-rare conditions where clinical outcomes depend on continuous medication use. Even brief gaps in therapy can compromise disease control, yet patients frequently encounter structural barriers including high out-of-pocket costs, prior-authorization (PA) delays, annual re-verification cycles, and refill logistics that disrupt persistence. This study evaluates a patient-centric Markov-chain framework for a...

6
Development and Validation of the Intensive Documentation Index for ICU Mortality Prediction: A Temporal Validation Study
2026-02-12 health informatics 10.64898/2026.02.10.26345827
Top 0.6% (3.7%)
Show abstract

BackgroundNursing documentation patterns may reflect patient acuity and clinical deterioration, yet their prognostic value remains underexplored. We developed the Intensive Documentation Index (IDI), a novel framework quantifying temporal documentation rhythms, and evaluated its ability to enhance ICU mortality prediction.58 MethodsWe analyzed 26,153 ICU admissions of heart failure patients from the MIMIC-IV database (2008-2019). Nine IDI features capturing documentation rhythm, volume, and sur...

7
Data-Driven Hybrid Model of SARIMA-CNNAR For Tuberculosis Incidence Time Series Analysis in Nepal
2026-02-24 health informatics 10.64898/2026.02.22.26346853
Top 0.6% (3.7%)
Show abstract

BackgroundTuberculosis (TB) remains a major public health challenge in Nepal, with incidence rates substantially higher than global estimates. Accurate forecasting of TB incidence is essential for early warning systems, resource allocation, and targeted interventions. This study aimed to develop and validate a hybrid Seasonal Autoregressive Integrated Moving Average (SARIMA) and Convolutional Neural Network Auto-Regressive (CNNAR) model for TB incidence forecasting in Nepal. MethodsMonthly TB i...

8
The Causal Impact of Natural Language Processing-Driven Clinical Decision Support on Sepsis Mortality in England: An Augmented Synthetic Control Analysis of NHS Trust-Level Data
2026-03-02 health informatics 10.64898/2026.02.27.26347253
Top 0.6% (3.6%)
Show abstract

BackgroundSepsis remains a leading cause of preventable hospital mortality in England, with NHS England reporting over 48,000 sepsis-related deaths annually. Natural language processing (NLP)-driven clinical decision support systems (CDSS) have been deployed in several NHS Trusts to enable automated early detection of sepsis from unstructured clinical notes, yet causal evidence of their effectiveness at the hospital level remains limited. ObjectiveTo estimate the causal effect of implementing N...

9
Federated penalized piecewise exponential model for horizontally distributed survival data: FedPPEM
2026-02-12 health informatics 10.64898/2026.02.11.26346054
Top 0.7% (3.1%)
Show abstract

Cox proportional hazard regressions are frequently employed to develop prognostic models for time-to-event data, considering both patient-specific and disease-specific characteristics. In high-dimensional clinical modeling, these biological features can exhibit high collinearity due to inter-feature relationships, potentially causing instability and numerical issues during estimation without regularization. For rare diseases such as acute myeloid leukemia (AML), the sparsity and scarcity of data...

10
Augmenting Electronic Health Records for Adverse Event Detection
2026-02-11 health informatics 10.64898/2026.02.10.26345962
Top 0.8% (2.9%)
Show abstract

ObjectiveAdverse events (AEs) resulting from medical interventions are significant contributors to patient morbidity, mortality, and healthcare costs. Prediction of these events using electronic health records (EHRs) can facilitate timely clinical interventions. However, effective prediction remains challenging due to severe class imbalance, missing labels, and the complexity of EHR records. Classical machine learning approaches frequently underperform due to insufficient representation of minor...

11
Thyroid Cancer Risk Prediction from Multimodal Datasets Using Large Language Model
2026-03-06 health informatics 10.64898/2026.03.05.26347766
Top 1% (2.7%)
Show abstract

Thyroid carcinoma is one of the most prevalent endocrine malignancies worldwide, and accurate preoperative differentiation between benign and malignant thyroid nodules remains clinically challenging. Diagnostic methods that medical practitioners use at present depend on their personal judgment to evaluate both imaging results and separate clinical tests, which creates inconsistency that leads to incorrect medical evaluations. The combination of radiological imaging with clinical information syst...

12
Early Detection of Absurdity Signals in Pharmacovigilance: A Machine Learning Ensemble Approach to Identify Rare Adverse Drug Reactions
2026-02-09 health informatics 10.64898/2026.02.06.26345783
Top 1% (2.7%)
Show abstract

BackgroundTraditional pharmacovigilance methods based on biostatistical approaches systematically exclude outliers and rare events, potentially missing critical safety signals. These methods fail to detect micro-clusters of adverse events and comorbidity patterns that may indicate serious but low-frequency adverse drug reactions (ADRs). We introduce the concept of absurdity signal detection - the identification of statistically anomalous but clinically significant adverse event patterns that co...

13
Patient-centric radiology: Utilising large language models (LLMs) to improve patient communication and education
2026-02-25 health informatics 10.64898/2026.02.23.26346923
Top 1% (2.7%)
Show abstract

PurposeTo evaluate whether large language models (LLMs) can enhance clinician-patient communication by simplifying radiology reports to improve patient readability and comprehension. MethodsA randomised controlled trial was conducted at a single healthcare service for patients undergoing X-ray, ultrasound or computed tomography between May 2025 and June 2025. Participants were randomised in a 1:1 ratio to receive either (1) the formal radiology report only or (2) the formal radiology report and...

14
MedOS: AI-XR-Cobot World Model for Clinical Perception and Action
2026-02-23 health informatics 10.64898/2026.02.18.26345936
Top 1% (2.1%)
Show abstract

Medicine historically separates abstract clinical reasoning from physical intervention. We bridge this divide with MedOS, a general-purpose embodied world model. Mimicking human cognition via a dual-system architecture, MedOS demonstrates superior reasoning on biomedical benchmarks and autonomously executes complex clinical research. To extend this intelligence physically, the system simulates medical procedures as a physics-aware model to foresee adverse events. Generating and validating on the...

15
Ed-Triage-Agent: A Framework For Human-Ai Collaborative Emergency Triage
2026-02-18 health informatics 10.64898/2026.02.17.26346501
Top 1% (2.1%)
Show abstract

AO_SCPLOWBSTRACTC_SCPLOWEmergency Department triage is a critical decision-making process in which clinicians must rapidly assess patient acuity under high cognitive load and time pressure. We present ED-Triage-Agent (ETA), a multi-agent AI framework designed to augment clinical decision-making in Emergency Severity Index (ESI) classification through human-AI collaboration. The system operates in two phases: (1) autonomous patient intake via a conversational agent that collects structured sympto...

16
Improvement in Albuminuria Screening Associated with EHR Decision Support Change
2026-02-14 health informatics 10.64898/2026.02.09.26345709
Top 1% (2.1%)
Show abstract

BackgroundAlbuminuria is associated with increased risk of cardiovascular disease (CVD), heart failure, and progression of chronic kidney disease (CKD). Early detection of albuminuria, done through spot urine albumin creatinine ratio (UACR) testing, enables more accurate risk stratification and timely use of preventative therapies. It remains unacceptably low in the hypertension population. MethodsWe evaluated two EHR-embedded clinical decision support (CDS) strategies at Geisinger Health Syste...

17
Class imbalance correction in artificial intelligence models leads to miscalibrated clinical predictions: a real-world evaluation
2026-03-05 health informatics 10.64898/2026.03.04.26347634
Top 1% (2.1%)
Show abstract

BackgroundPredictive models employing machine learning algorithms are increasingly being used in clinical decision making, and improperly calibrated models can result in systematic harm. We sought to investigate the impact of class imbalance correction, a commonly applied preprocessing step in machine learning model development, on calibration and modelled clinical decision making in a large real-world context. MethodsA histogram boosted gradient classifier was trained on a highly imbalanced na...

18
Show Your Work: Verbatim Evidence Requirements and Automated Assessment for Large Language Models in Biomedical Text Processing
2026-03-04 health informatics 10.64898/2026.03.03.26346690
Top 1% (2.1%)
Show abstract

PurposeLarge language models (LLMs) are used for biomedical text processing, but individual decisions are often hard to audit. We evaluated whether enforcing a mechanically checkable "show your work" quote affects accuracy, stability, and verifiability for trial eligibility-scope classification from abstracts. MethodsWe used 200 oncology randomized controlled trials (2005 - 2023) and provided models with only the title and abstract. Trials were labeled with whether they allowed for the inclusio...

19
ChatGPT with Mixed-Integer Linear Programming for Precision Nutrition Recommendations
2026-02-17 health informatics 10.64898/2026.02.14.26346312
Top 1% (2.0%)
Show abstract

BackgroundThe growing interest in applying artificial intelligence in personalized nutrition is challenged by the complex nature of dietary advice that must balance health, economic, and personal factors. Though automated solutions using either Linear Programming (LP) or Large Language Models (LLMs) already exist, they have significant drawbacks. LP often lacks personalization, whereas LLMs can be unreliable for precise calculations. ObjectivesTo develop and assess a model that integrates a Mix...

20
Bias in respiratory diagnoses by Large Language Models (LLMs) in Low Middle Income Countries (LMICs)
2026-03-03 health informatics 10.64898/2026.03.02.26347405
Top 1% (2.0%)
Show abstract

IntroductionClinicians and patients are likely to increasingly use Large Language Models (LLMs) for diagnostic support. Use of LLMs mostly created in North America and Europe, could lead to a High-Income Country bias if used in Low- and Middle-Income Country (LMIC) healthcare settings. We aimed to explore if diagnostic suggestions made by LLMs are relevant in LMIC settings. MethodsFive short respiratory clinical vignettes were produced. For each vignette, a group of doctors from one of 5 countr...